A Fast Algorithm for Aperiodic Linear Stencil Computation using Fast Fourier Transforms

نویسندگان

چکیده

Stencil computations are widely used to simulate the change of state physical systems across a multidimensional grid over multiple timesteps. The state-of-the-art techniques in this area fall into three groups: cache-aware tiled looping algorithms, cache-oblivious divide-and-conquer trapezoidal and Krylov subspace methods. In paper, we present two efficient parallel algorithms for performing linear stencil computations. Current direct solvers domain computationally inefficient, methods require manual labor mathematical training. We solve these problems stencils by using DFT preconditioning on method achieve solver which is both fast general. Indeed, while all currently available solving general perform Θ ( NT ) work, where N size spatial T number timesteps, our o work. To best knowledge, give first that use Fourier transforms compute final data evolving initial many timesteps at once. Our handle periodic aperiodic boundary conditions, polynomially better performance bounds (i.e., computational complexity runtime) than other existing solutions. Initial experimental results show implementations evolve grids roughly 10 7 cells around 5 run orders magnitude faster problems, 1.3 × 8.5 problems. Code Repository: https://github.com/TEAlab/FFTStencils

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Computation of Partial Fourier Transforms

We introduce two efficient algorithms for computing the partial Fourier transforms in one and two dimensions. Our study is motivated by the wave extrapolation procedure in reflection seismology. In both algorithms, the main idea is to decompose the summation domain of into simpler components in a multiscale way. Existing fast algorithms are then applied to each component to obtain optimal compl...

متن کامل

Fast Fourier Transforms

29 O(b log(b)) operations (using standard multiplication). As there are O(b= log(b)) primes in total, the running time of this stage of the algorithm is O(b 2 L), even using the \grammar school" method of integer multiplication. At this stage of the algorithm we have obtained a vector of length L whose entries are integral linear combinations of powers of with coeecients bounded by M in absolut...

متن کامل

Dimensionless Fast Fourier Transforms

This paper shows that there are fast Fourier transform (FFT) algorithms that work, for a fixed number of points, independent of the dimension. Changing the dimension is achieved by relabeling the input and the output and changing the “twiddle factors.” An important consequence of this result, is that a program designed to compute the 1-dimensional Fourier transform can be easily modified to com...

متن کامل

Fourier Transforms and the Fast Fourier Transform ( FFT ) Algorithm

and the inverse Fourier transform is f (x) = 1 2π ∫ ∞ −∞ F(ω)e dω Recall that i = √−1 and eiθ = cos θ+ i sin θ. Think of it as a transformation into a different set of basis functions. The Fourier transform uses complex exponentials (sinusoids) of various frequencies as its basis functions. (Other transforms, such as Z, Laplace, Cosine, Wavelet, and Hartley, use different basis functions). A Fo...

متن کامل

Fast Computation of Voigt Functions via Fourier Transforms

This work presents a method of computing Voigt functions and their derivatives, to high accuracy, on a uniform grid. It is based on an adaptation of Fourier-transform based convolution. The relative error of the result decreases as the fourth power of the computational effort. Because of its use of highly vectorizable operations for its core, it can be implemented very efficiently in scripting ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Parallel Computing

سال: 2023

ISSN: ['2329-4949', '2329-4957']

DOI: https://doi.org/10.1145/3606338